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ABSTRACT 



SUBM is a CONVERT program, realized in the CTSS LISP of 
Project HAG, for constructing the subset machine vith the sane 
behaviour as a given transition system* The proRran interactively 
collects the six items defining a transition system: its state set f 
alphabet, transition function, initial states, accepting states and 
spontaneous transitions. It then conputes the subset machine, producing 
its state set, transition function, initial state and accepting states. 



A transition system Is a variant of the concept of a finite 
state machine which very often provide! a more economical and convenient 
notation. One of tho principal uses of a transition system is as an 
intermediate step in the process of constructing a finite state machine 
which will accept precisely those words designated by a given regular 
expression, since there is a very straightforward process by which the 
transition systems corresponding to individual regular expressions nay 
be combined to correspond to the concatenation, union, and iteration 
of regular expressions; the three processes from which compound regular 
expressions are formed from simple regular expressions, 

A transition system is essentially a graph, Tho nodes of tho 
graph represent the states of the transition system, while the directed 
arrows connecting the states correspond to input words , and represent 
the transitions^ of the system. Arrows representing all possible words 
are not drawn, but only those representing individual letters or the 
null word. The combinations of these primitive arrows yield composite 
arrows. 

In addition to tho graph, there is specified a subset of the 
states called the initial states as well as another subset, called the 
accenting states . The uso of the transition system is to distinguish 
those words for which there exists at least one path from an initial 
state to on accepting state from thoso for which there docs not, 

A finite state machine may also be represented by a graph, but in 
contrast there is only one initial state, and there is exactly one arrow 
emerging from each node for each letter of tho alphabet, and no arrows 
corresponding to the null word, By contrast there may be several arrows 
or none corresponding to s given letter ODerging from a node of a transition 
system. 

The construction of a finite state machine equivalent to a transition 
system consists in taking as states of the MChiM subsets of states of the 
transition system. By taking as the image of a machine state the set of 
all possible states of the transition system derivable by the same letter 
from some state of the argument set, one ensures that there will bo one 
imago and honce one arrow for each letter of the input alphabet. Of course, 
the image set could be the empty set. 



It then remains to define the initial and accepting states 
appropriately so that both the transition system and its subset machine 
make exactly the sane distinctions of input words. The Initial state 
of the subset machine is conveniontly taken as the set of initial states 
of the transition system, together with any states derivable from then 
by Hans of the null word. The accepting states are those subsets which 
contain one of the accepting states of the transition system. 

Whereas the subset machine of an n-stato transition system would 
contain 2 n states, one is really only interested in the connocted 
component of the subset ouchinc, and the connected component nay contain 
considerably less than this number of states. 

The formal definition of a finite state machine, or automaton, is 
that it is a quintuple 

S ■ <S, t, M, a, F> 

S is the set of states, generally carrying the same name 
t is the input alphabet 

M: S-E * S is the transition function 

acS is tho initial state 

FCS is the sot of accepting states. 

The formal definition of a transition system is that it is a 
sextuple, 

T - *T, I, N, A, 0, P> 

T is the set of states, generally carrying the sane name 

Z is tho input alphabet 

N: Tx£ + 2 is the transition function 

Act is the set of initial states 

OCT is the set of accepting states 

PCT»T is the spontaneous transition relation. 

In the usage of transition systems one requires the following further 
definitions. 

P is tho reflexive and transitive closure of P. 
|s, o, t| ; There is a transition from s to t mediated 
by o: when o - a , ( s ,t) c P; otherwise 
when a - *o , | s , „, k | ^nd qfiNCk.o) and 
(l.tjc P, for some k and q. 



The subset construction itsolf consists In forming the following 
machine, Riven the transition system T» 

S..2 T 

t - I 

M(Uj a) - {tcT :-Km r/u o t|) 



F - 0C8 1 



ono 



In order to cany out the subset construction the COKVERT progran 
SUBM was formulated. However, on account of the [JSP substrate of CONVERT, 
it appears in tho CTSS time sharing systom of Project MAC as n LISP 
function of no variables, (SUBH). As befits its presence in a time sharing 
interactive ©nvoironrent, the program j»ives instructions fur further data 
to be input during the course of its execution, and in this manner the 
components of the transition system T are specified. The operation of the 
pro fi ra» pay be illustrated by following its development durin E the 
execution of an example, which is the following transition system of nine 
states. 




State number 1 is the only initial state, and is also the only 
accepting state. 

The following is a transcript of the execution of this example, 
annotated to indicate the sequence of steps. 



The gathering of data about the transition system thus follow 
the following course, 

Read in the alphabet 

Read in the stato set 

Read in the list of initial states 

Build the transition table seeding the transitions 
fro- one state at a tine 

Read in the Spontaneous Transition Relation 
Fom its Transitive Closure 
Read in the list of accepting states. 
Thereupon the data concerning the subset machine is calculated^ 
and output as it is determined, 

Detormine the initial state, it is tho set of initial 
states of the transition system together with their 
successor states by the null word, determined from 
C P") . 

Determine the transitions, at the sane time noting the 
new states which arise. 

When all the transitions arc known, the state sot of the 
connected component of the subset machine is known and 
may be printed. 

The accepting states among them may bo determined by 
testing their intersection with the set of accepting states 
of tho transition system. 

At this point the program enters a loop in which one may ask for 

specific transitions which interest him. The program is terminated by 

writing ST; other requests will be unceremoniously rejocted. 
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CTSS will now execute CONVERT 
the ti**e is a signal that CONVERT ie ready 
load the disc fi le SUBM LISP 
syncm/m for (), means the file is loaded 
LISP is asked to execute (SUBM) 

SUBM asks for the alphabet of tfie transition system 
which is a binary alphabet 
SUBM now asks for a list of the states 
B 9) which are numbers, 1 through 9 

SUBM asks for the initial states 
there is but one, but it rust be listed 
SUBM no} asks for the function fl 
requesting first images of state nurier 1 
of which there arc none 
SUBM asks for the image of state nur&er 2 
J carries 2 into 3; carries 2 into 9 

SUBM continues to list the states, to which 
the response is a list whose elements are 
lists; the input letter foll&jcd by the 
states to which that letter causes 
transitions* If there are no itnitge 
states the letter need net be mentioned 
either. Thus if 1 caused transitions to 
states 3, 4, and S, but caused no 
transitions, the response would be 
(<t 3 4 S))< 
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0US TRANSITIONS) 



7) 



(6) (S (0 6) (1 9)) (4) (3) 

SUBM uritea ita internal representation of X 
SUBM not aaka for the epontaneouG transitions 
firet those originating from state nur&er 1 
of which Z ie the only ouch otate 

SUBM continuea Hating the atatea in 
order, and expects in each case a list of 
those states to which the system mrj pass 
spontaneously, indicated in the transition 
diagram by arr&js labelled A. Id there are 
none, the response ie (), 



(TRANSITIVE CLOSURE) 
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SVBM no) computes the transitive and reflexive 
closure of the spontaneous transition 
relation which it has been givsn M and 
prints the reeult M in the form of a lint 
of thoea ot ate 8 accessible from the first 
state of the list, by means of a chain 
of null words. 



SUBN nM asks for a list of the acccptina states 
of which 1 is the only one, but it must be listed 

We n&J begin the output of the description of the 

subset machine 
First the initial state is listed; it is a &ul*set 



(1 
(3 



2)) 
4 5 



followed by the transition function M, tohiclt is 
defined as a table uhich has the form 
f™ ft, — (i t) ™; =^ utiich in 
to say that it is a list of sublists, one for 
each state. One of these subliats begins with 
the none of the state* and contains pairs (i t) 
which state that letter i cax$£* a transition 
to state t. There ie one such pair for each 
letter of the alphabet, 
((3 4 S 7) (0 (4 S 6 7 9)) (1 (1 2 8 9))) 



7))) 

n 

(ORIGINATING FROM (3457)) 



(ORIGINATING FROM (12 8 9)) 

((1 2 8 9) (0 (9)) (1 (3 4 S 7))) 

(ORIGINATING FROM (4567 9)) 

((4 5 6 7 9) (0 (4 5 6 7 9)) (1 (1 2 8 9))) 

(ORIGINATING FROM (9)) 

COO (o ()) (i ())) 

MM 

(ORIGINATING FROM ()) 

10 (o ()) (l ())) 



(STATE SET) 

& 

(4 S 6 7 9) 

(12 8 9) 

(3 4 S 7) 

{1 2) 

(ACCEPTING STATES) 

((i 2 8 9) (1 2)) 

st 

(». GOODBYE ■■■) 

stop 

VALUE 

NIL 

R 13.683*20.650 



The states co^prieing the subset /machine are Hated 



The accepting states arc listed 

SUBM is conwmdad to cease operation 
to which it ivuponds amiably 
LISP ia asked to pease operation 
to which it returns its xmIho 

and CTSS the total elapsed time 



*G- 



With the results so obtained ve nay draw the transition diapron 
* of the subset machine. 




C7> 



fcTTT^) 



Tho CONVERT program itself is a fairly straightforward 
realization of tho theoretical steps necessary for the construction. 
SUBH is primarily a CONVERT ProRram, which calculates in order tho 
quantities of interest after having collected the necessary data. 
Necessary sets are treated as lists, while the transition functions ore 
treatod as tables, and also stored as lists with an approprinte 
structure. This structure is (— (s ... (i t ) »«) .«) ; which iwans 
that for every state s there is a list of transitions headed by the nanc 
of the states. The entry (i t) moans that M(s, i) » t , in the case of 
a machine. In the case of the transition system, the entry would be 
(i ttt) # where the fragment ttt was a list of the elements conprising the 
•jet of images under i; thus tf(s, 1) ■ (ttt). 

There is a problem in tho representation of sets by lists that the 
sots so represented may be equal without the corresponding lists being 
equal, due to the fact that lists more properly represent ordered sets. 
CONVERT does not immediately rocognize set equality; rather than revise 
CONVERT to include a pattern to test equality of sets, it is preferable 
to normalize the lists representing sets. This is accomplished by a 
double complementation, if S is the set and U a subset, the normalized 
fom of U is (-COMP- S |>C0»IP- S U)J. it is to be recalled that -C0MP- 
does not change the order of the survivin£ olements of its argument. 



Tlic argument M of CONVERT, in the definition of SUMI, contains the 
following five entries* 
(•READ*) 



NORM 



(SUSV) 



SUSE 



ADJS 



On account of CONVERT not containing the 
fragment Analogue of "REAP*, which roads 
one "S-Cxprossien" from the telotype console. 

A function (NORN X) which places the elements 
of the subset of states X in their standard 
order, by double complementation. 

A fragment-valued function (St 1ST S I) which 
finds the "successors of o state" wherein S is 
a state, and L is a letter of the Alphabet, 
It first of all finds all those states listed 
in the transition table (*Tlt*) » indues of 
S under L, and then examines the spontaneous 
transition table (*P*) f which is extended to iu 
transitive and reflexive closure in nn initial 
Stftgfl of the proRran, to sec how many further 
states arise fron the ima^c states by spontaneous 
transitions. 

(5USE X I) concatinates the successor states 
for each element of the set S accordine to 
the letter L; it therefore finds the"succe$50r* 
of a set." 



In forming the set of states of the subset nachlno, 
we do not undertako to discover all the subsets of 
the state set of the transition system f but rather 
those of its connected component, which arise fron 
the initial state by some chain of transitions. 
In the process a list of subsets already found is 
kept, ( # S5*). (ADJS X) is used to "adjoin a state" 
to this set. If X is new, it it adjoined to (*5S*). 

The argument I of CONVERT consists of variables S, L, and X; fragments 
XXX and YYY. 

Tlie argument E of CONVERT is vacuous, since the program receives 
its data by moans of «REAi»*. 

The argument R of CONVERT is one single program. Its program 
variables have tho following significance, 

•A- The common alphabet of the transition syston 

and the subset machine. 

■AA» The accepting states of the subset machine. 

■I" The initial states of tho transition system. 

■S» A state of tho subset machine. 

(*S*) Tho state set of the transition system, 

(*SS*) The state set of tho subset machine. 






(•TR-) 

err-} 
CP*) 



The list representing the function N; that is 
the transition mole of the transition syston. 

The list representing the function M. that is, 
the transition table of the subset machine. 



(•F*) 



The spontaneous transition relation, which is 
determined by an -READ- cyclo; for each state 
there are listed that state and its spontaneous 
images. This relation is promptly replaced by 
its transitive closure, union form it retains 
throughout the program. 

Tho list of accepting states of the transition 
system. 

A working variable used in various parts of the 
program. 

Following the program in rough outline we see that it follows tho 
steps seen in the example. The necessary data to define the transition 
systen are collected. A request for the data is written on tho user's 
console, and when it is furnished, is stored in an appropriate program 
variable. No test is made to see if the data furnished is plausible, 
which assumes that a fairly experienced person will use the program. A 
syntactic check could be Included if necessary. Some of the data is 
collected step by step; rather than inputting tho whole transition table 
at once, it is called for state by state, the program volunteering tho 
state name so that none will be overlooked. 

The transitive closure of the spontaneous transition relation is 
calculated by a subprogram. This program uses the program variables 
(II) and (I-). Initially, (-P-) is the input spontaneous ironsltion 
relation, a list whose sublists consist of the different states followed 
by their iiumediato spontaneous imagos. The variable (I) runs through 
these lists, to the current one of which (II) is initially set. Thus 
(II) contains the 1 mediate ima S e of a certain state, s. For each of these 
imediate images, an immediate image is sought, giving second images 
derivable from s not by A, but by *x. Any new states found by this process 
form the set (*'). The new states are adjoined to (II), and the cycle 
repeated with a search for third iaages, and so on. The process stops 
when no new i Mg es are found, and the next state is investigated until none 
»rc left. 
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